Punctuation Annotation using Statistical Prosody Models
نویسندگان
چکیده
This paper is about the development of statistical models of prosodic features to generate linguistic meta-data for spoken language. In particular, we are concerned with automatically punctuating the output of a broadcast news speech recogniser. We present a statistical finite state model that combines prosodic, linguistic and punctuation class features. Experimental results are presented using the Hub–4 Broadcast News corpus, and in the light of our results we discuss the issue of a suitable method of evaluating the present task.
منابع مشابه
Automatic Punctuation Annotation in Czech Broadcast News Speech
This paper reports our initial experiments with automatic punctuation annotation from speech. We have focused on Czech broadcast news speech. The task can be defined as a classification of each inter-word boundary into one of target classes. We considered comma, sentence boundary and “no punctuation” as the target classes. We employed two statistical models – prosodic model and language model. ...
متن کاملSemantic-Based Image Retrial in the VQ Compressed Domain using Image Annotation Statistical Models
متن کامل
Sentence-Internal Prosody Does not Help Parsing the Way Punctuation Does
This paper investigates the usefulness of sentence-internal prosodic cues in syntactic parsing of transcribed speech. Intuitively, prosodic cues would seem to provide much the same information in speech as punctuation does in text, so we tried to incorporate them into our parser in much the same way as punctuation is. We compared the accuracy of a statistical parser on the LDC Switchboard treeb...
متن کاملText punctuation and prosody in Greek
A production experiment was carried out, in order to investigate text punctuation, including standard as well as ungrammatical (communicative) punctuation marks, and prosody relations. It is shown that punctuation is directly related to the duration of pauses, leading to the following structure: question mark>exclamation mark>full stop> colon>comma> ellipsis. Pitch resetting occurs in all cases...
متن کاملA Phonological Phrase Sequence Modelling Approach for Resource Efficient and Robust Real-Time Punctuation Recovery
For the automatic punctuation of Automatic Speech Recognition (ASR) output, both prosodic and text based features are used, often in combination. Pure prosody based approaches usually have low computation needs, introduce little latency (delay) and they are also more robust to ASR errors. Text based approaches usually yield better performance, they are however resource demanding (both regarding...
متن کامل